DOCUMENT RESUME 



ED 212 286 



IR 009 993 



AUTHOR ' 
TITLE 

SP0N5 AGENCY 

PUB DATE 
NOTE 

EDRS ^RICE 
DESCRIPTORS 



IDENTIFIERS 



Mdndel, Carol A, • 

Subject Access' in the pnline Catalog, 

Council on Library Resources ,-"Inc. , Washington, 

D.C. < 

Aug 81 ^ , 

31p. . ' • . - 

MF01/PC02 Plus. Postage, 

♦Cataloging; Databases; Mndexing;. *Library Catalogs; 
Library Research; Man Machine Systems; *Ooline 
Systems; Subject Index Terms 

Failure Analysis; Frfee Text Searching; /Library of 
Congress. Subject Headings; Preserved Context indexing, 
System; *User Needs 



ABSTRACT < . 

This review of the research oa subject access to 
library collectibns focuses on the problems of and prospects for 
improved online subject access to library collections .Summaries of 
the general findings of studies on library catalog use" and catalog 
users and some reasons for the frequent failure of subject searches 
in library catalogs are followed by a discussion of the use of *~ 
/failure analysis' 1 as a technique in .studies of automated information 
retrieval systems. The advantages and disadvantages of free*t£xt 
searching are reviewed, the feasibility * of using^the Preserved 
Context Indexing System (PRECIS) to supplement Library of ^Congress 
Subject Headings is briefly considered, and some of the .conclusions < 
drawn from studies of library users*' needs are presented. Research on* \ 
the enrichment of cataloging ?<£COxd^ using free text descriptors, on 
enhancing currently used subject access systems such as library of 
Congress Subject Headings.,, and on ensuring the effectiveness of the 
user interface with -an online catalog is also discussed. Six, 
recommendations are made for the improvement of subject access in 
online catalogs, and a 41-item reference list is included. (JL) 



v. , 



****************************************** *****^*** ******************** 

* Reproductions supplied by EDRS are the best that can be ftt.de * 

* from the original document. 0 ^ * 
************************** *********** ******************** ************** 



c 



CO 

C\Y 

CM 
*— « 

CM 
Q 
U4 



\ 

U -* DCPAJTMttfT OF EDUCATION 
•NATIONAL INSTITUTE OF EDUCATION 
EDUCATIONAL RESOURCES INFORMATION 
, CENTER (ERlCI 

^Th«document has been replaced as 

EST P ~" " °'~ 
I J M«or changes have been mad* to improve 
reproduction quality 

* P ?' nt * ofvMeww 0P«n«>ns stated m th« doc7 
mcm d0 001 necesMnfy represent official NIE 
, Portion or policy 



Subject' Access In the Ohl.ine Catalog 



A report prepared for the 
Council ' on Library Resources 
by 



^ . Carol Mandel : 
with the assistance of 
■Jud i th Herschman J 



August 1981 



0" 
%■ 

ERLC 



-2- 



ACKNOWLEDGEMENTS 



/ ■■ ■ 

'The authors wish to thank the many individuals who took the 
\ 

time to talk with us and to share with us the results of their 
research and experience. We are particularly grateful to Pauline 



Atherton Cochrane , 
he! p and advice. 



Laura Kassebaum , and Robert Zi ch for their 



TABLE OF CONSENTS 



Introduction . . . A ■ 4 

Research on Subject Sfearching in Library Catalogs: 

Findings & Method? m . j 6 

General Findings f • •• 6 

Research for Search Failures ' . . 8 

Failure Analysis...... ^ 10 ~ 

Free Text vs. Controlled' Vocabulary Se archi ng. • . • 11 

PRECIS 13 

Studies of User's Needs......' •••• 13 . 

Building on What is Known ... 15 

.Research qh Enriched Records •••• 15 

Research on Search Failure.........*.../........*. 16 

Enhancing the Current ■ Jte^Jroif of Subject Access. 17 

Ensuring an^ : ff^ctTve" User Interface \... 2CU 

Summary of ,R§commendat i ons . 22 

References . 26 



INTRODUCTION 

. \ ' • / ^ i* < 

New technologies are often adapted to traditional uses 
without fully exploiting added capabilities. - To fake full 
advantage of new developments, careful 4 .pi anni ng is needed, 
librarians and information scientists^ are particularly conscious 
■^of the need to apply rapi exchanges in cpmputer and communi cat i on 

9 , S 

techno] ogies. t o expandi njLJthe ability to store, manipulate, and 
retri eve *i nf ormat i on. \ 

' ' A- " 

Librarians recognized v'early in their use of online 

cataloging! systems that theN computer was not only a very? 

sophisticated cat a 1 olf^fard production machine but ^ a device for 

retrieving bibliographic i nf ormar^i on in enti'rely new ways. The 

difficult question was,l>and still Ys, not whether to exploit the 

computer for the benefit ^of library us&rs, b.ut^how. The question 

has H^een considered inflation to a variety of library catalog 

formats during the 1970's, but now it ik clear that during the 

1980 r s- the most widely used format \will be direct user 

interaction with online public- access library catalogs."" The 

nature .of the yse of the many bibliographic database's that have 

• been mad*e £ccessvbl e online through commerc i servi ces indicates 
* 

that computerized records and the software that manipulates them 

permit, much more powerful, searching strategies than do 

traditional card catalogs. This has prompted questions about the, 

♦ 

kinds of searching techniques 'necessary and desirable^n online 
library catalogs and, more important, whether librarians are 
seriously limiting thei r * new potent i al by loading only 
traditional ^bibliographic records into computerized catalogs. 

* " * 

These questions are , part i cul arly pressing in relation to 

searching for materials on a particular subject or'for items for 
which the author or title' are only dimly remembered. Such 
materials are sought by nveans of key words or subject terms; ia 



traditional American library catalogs this usually means the 
first word of a title or a ^Library of Congress Subject Heading. 
(LCS'H). • The latter has been widely criticized by librarians and 
there- is cons-*derabl e. concern that the present limitations of the 
LG. .subject system - should not .be carried over into the era of 
online library catalogs.. An ,of ten-quoted statement of the 
problem was made by Bates in 1977: 

s 

If we simply transfer the. austerity-based LC subject 
heaBing approach to expensive fcorriputer systems, then 
v ' we have used our computers merely to embalm the con- f 
^traints that were imposed on library systems back be- 
fore typewriters came into uselj 



'This paper examines "research on subject access in light of 
problems of and prospects for providing online subject access to 
library collections. "Successful" subject searching can only be 
defined in terms of the- object i ves of the access system and the 
expectations of the. reader r - e.g., is an exhaustive bibliography 
desired, "or only a . few select books on a topic. Since /the 
emphasis of this paper is on the kinds of access traditionally 
provided 'through .library catalogs, Sit is assumed that the 
objective of a subject search "in a library catalog wouVd be: 1) 
to lead the reader from the topics he or she has in mind" to the 
relevant vocabul ary terms ,a va i 1 abl e in -the catalog; 2) to provide 
the reader with records for most (80%) .of the books in the system 
on the topic in' question $but not necessarily to parts of books); 
"and 3)- to provide t"he reader with enough information to decide 
whether or not to call for the item identified. by the search. 

* 

« • * 

This is a very modest set of' objectives and may not be 
apceptable^ to librarians who believe it is important to prov'ide 
in-depth '^subjeest analysis to library users. ' However", this paper 
is . concerned not" pnly with, what constitutes the best possible 



0 



6- 



nieans of subject access, but with political and economic 
•considerations that are likely to affect decisions regarding 
future library catalogs. , While 'a variety of. methods for 
retrieving .subject information have been used ■ successful ly * i n 
systems designed for specialized subject areas, they cannot be 
applied directly and immediately to library catalogs. The 
transition from present library methods of subject analysis to 
new forms o'f access accepted and -applied by libraries will be a 
gradual process!, accompanied by tes.ting and experimentation. The 
purpose- of .the paper is to sug-ges't areas where the Bibliographic 
Service Development Program' might initiate or support efforts 
that will help research- 1 i brari es improve subject' acce&s through 
onl ine catalogs. * 



RESEARCH ON SUBJECT SEARCHING IN LIBRARY CATALOGS: * 
' FINDINGS AND METHODS 

* * $ 

Genera 1 Fi ndi ngs ; - 

♦ * * * * 

Studies .of catalag use and catalog users 'provide an overview 
of who uses the subject catalog, how often subject searches are 
successful (using varying definitions for '"success".) , and how 
persistent subject searchers are. The. bulk of the research , of 
course, describes -the manual catalogs that have been the major 
method of accessing library materials for over a qentury; 
however, spme work has been done op machine files as well. Even 
the res.ear.ch do-he on-manual card catalogs provides insights for 
p'l ahni ng^-bnl i ne ""catalogs because It -Is * Important to understand 
the use i^ade of ^the library bibliographic record. Library 
records contain a standardized, limited set of data elements 
rather than; the descriptors,, abstracts, and even full texts 
available in' other kinds of files. 

•T*ie basic findings may be summarized as follows,. 

*. • , 

\ • • 



Subject hea-ding searches are sometimes used to identify 
items already., known to the searcher; 2 conversely, some 
"known-item" searches • are searcles only for subject 
information.^ 

Although "k.nown-item" searches account for more card catalog 
use than subject searches, the proportion of subject- 
searches varies with the user population. Several' studies 
demonstrate an inverse relationship between the amount of 
subject se-arching and the user's level of expertise. In a^ 
recent study at Dartmouth, on<ly 28.6% -of the faculty 
surveyed . reported that the subject approach was the search 
method they used most often, as compared t to 51.4% of the 
undergraduates questioned. 4 This may have changed 
si gnif icantly"~wi th the introduction of detailed subject 
searching in online catalogs at "Dartmouth , but the new data 
ha,ve not yet been analyzed. 5 

Users often select terms,, that 'ar'e, either 'too broad or too 
narrow. 6 Separate subject heading lists, such as the LCSH, 
are rarely used ,to identify terms for. searching, even when 
the lists are placed* near an online catalog terminal. » 

About half of.Hhe terms used by readers in theiff i rst try 
at the subject catalog correspond to either a heading "or a 
reference* found-' in the catalog., If subsequent tries are 
■'included, the success rate rises to about 70%. 9 

Not all users persist in subject searching until they are 
successful. . Between- two-thirds and three-quarters of- the 
searches in manual subject catalogs, whether successful or 
not, do .not continue beyond a single look-up. 10 There are 
some indications, however, that users'might show greater 

1 1 1 

perseverance when using an online catalog.- 1 

» 

Online searching in a smalt database .is considerably more 



successful both in terms of number of relevant documents 
found and search time per tiseful document, when additional 
descriptive terms taken from indexes and tab-les of contents' 
of book's are added to the MARC record and made accessible. 12 
This approach has not yet been* tested in a large database. 

7. Li brary ♦ catal og ,users thi*nk that more access points, - both 
"subject headings an.d key words, should be added to records 
for books:* 3 Standard library cataloguing practice currently 
results 'in aJt\ avera-g* of slightly* under* 1.5 LG subject 
headings per'rec'ord. 15 ^ *• - v 



• Reasons f t or Search Failures t , 

* • * 

"Very little is known ab'cfat the reasons- why 5'0% of. first 
attempts to seek a terVip the ■ subject -catal og fai 1 ,. although 
there is' general consensus* among librarians, and some evidence 
that the lack,, .of. iafe$$t i ci ty in LC s,u.8ject terms and the lack of 
"see". 'references ;i n;,l i brary catalogs are the major contributing 
factors. . The 50% "hit-rate" foY .terms' .used by^the reader, is, 
prima ' facie evidence that. the' entry vocabulary of library 



catalogs is inadequate, 'in other words^ the natural language 
that expresses readers' requests' is not mapped, either through 
cross references or sufficiently convenient displays in the 
thesaurus used, tf> the terms appearing in the library catalog. 
rich e^try vocabulary is not .inexpensive to maintain, but has 
been demonstrated -to be cost-effective because it greatly reduces 
the i rvtel 1 ectua'K burden' o-q both the cataloger and the-searcher. 16 



Lack , of Specificity in<LC subject terms as -the cause of many 
subject search failures', is more difficult - to demonstrate 
conclusively. An early library catalog study and more recent 
studies of information - retrieval systems have shown that, in 
general, material \n subject areas With more abstract language 



(e.g., education) is more difficult to access with precision tha/v 
items in areas *with relatively "hard" languages (e.g>., 
chemi stry ) ♦ l- 7 A stuqly done by Lipitz at Yale, where LCSH<are 
us6d for'subject access,- found that , "users engaged in subject' 
searches ■ frequently complained that subject sections in the 
catalog are much too large and general^ rarely narrowed to cover 
only the particular subject aspect of interest 4 to the user. 1 ' 1 . 8 
The complafnts of Yale, users are substantiated by a recent* 
analysis, of a* sample- of books classed wifh LC classification and 
given LC subject headings, a The analysis demonstrated that in a 
number of c 1 ass i f i c at~i on r»ges, subject headings did not add 
appreci ably to discriminating among all ofjt-he items assi gned .the' 
same class nlimber.' This' led t he - i n vest i gat or -to conclude that 
"in^ these areas the reader could do just as well (or better) at 
the book shelf than they could in the library's- catalog." 19 

However, the f requent 1 y- vo i ced complaints regarding lack- of 
speci f icity in LCSH do net necessarily reveal- that the actual 
vocabulary ot the l ist is the -cause of the search failure. t For 
exampl e,' readers ' requests often may be too v s pec i f i c to be met by 
monographs indexed a,s a whole, even . though American library 
'catalogs generally do not contain subject entries for parts /of, 
bo^ks. This difference between library policy and reader's 
requests could account, at -least 'in part, fbr^tfie demonstrated 
superiority of a system .that adds • i nf ormat i on derived from the 
Indexes 'and tables of .contents of monographs:** In addUioh to 
policy decisions, flaws jn indexing practice may account for 
searching" failures. After being shown a number of exampTejs of 
overly-general terms assigned by LC catalogers, Edward Bjl ume, 
(then l+ead of the LC Subject Cataloging Division) noted,__ "SubijecJt 
headings can be created as needed, but often cata/logers choose, 
not to do so. Man# of the bad examples of J.C subject indexing 
cited by various speakers are not e/amples of . the limitations of 
the. system as such, but rather examples of extremely bad 
cataloging. "20 



10 



Fa i Hire- Analysis' 



•The inability to specify the precise causes of failure in- 
library subject ^searching illustrates a metfrodol ogy . probl em in 
research oji indexing systems recently described by Svenoni us . tJ - 

She points out that comparisons or. evaluations of systems are 

1 

studies of "aggregate variables?" overlooking the -separate 
elements that make differing contribution^ to the success or 

' failure of the system. v Even studies restricted to the indexing 

"language mis$ specific features that' "may account for .essential 
differences. While Svenoni us.',s ■ call for more theoretical 

•research in this area is apt, existing methods for evaluating 
indexing languages by ^compa^i ng terms in the system with actual 
request statements could also provide useful results in studies 
of*, subject access in ohlj'ne catalogs. 22 The online catalog 
provides the opportunity to monitor the frequency of use of , 
different types of search terms, to observe which of the terms 
appearing in t'he a dat abase are used in search i ng ,. and to enumerate 
and analyze those terms used' in searching that do not irratch terms 

'accessible in the database. 23 

» 

General performance ' measures used without analyzing * the 
.reasons for*search failures do not provide the information needed 
to make decisions leading to improving a system. "Failure 
•analysis" is- commonly dorfe in studies of automated information 
systems since machine searching can^provide a step-by-step record 
of a searchSn'thout i nconve^efici ng the searcher. Although the 
results of ' such studies as reported in the literature arer J 
specific to the systems under scrutirvy*, King has noted that . 
search failure -can be * exp.ected to fall into the following 
categories: 



' "*It'is easier to generalize the resul.ts^of library catalog studies 
because subject access mechanisms in libraries are fairly standardized. 
However, differences among ' 1 i brari es , such as policy in providing Cross 
references, may often "be_jjnderestimated. 

lie . - ■• H • 



, • -11-. 

* ■ » % 

1. failures o.f policy or practice in indexing; 



2. .failures i tke vocabulary useft; usually d.ue to >ack of 
specificity or \a ambiguous or spurious relationships 
between terms ; v v 

3. failuresip searching strategy; '• >• . 

.4. failure to refl ect "accurately the user's information need in 

V, . 04 ' » 

t h esearTT^*^ c f - H 

Thorough failure analysis 'inVolve* examination of the., document 
\ missed f tyidex„ing .records ,' requests , seaq^h strategi es , and the 
users! relevance assessments, \ln-depth analyst^ also can be 
'i nfornwati,ve when both successful \^nd unsuccessful searches are 
» compared. 

) 

Free Text vs . to nt roll ed Vocabulary Searching - o 

Many of the. other techniques and- performance measures /that 
are well developed, for evaluating automated information systems, 
such as measures of precision, estimated, recall, and 'search-time 
per relevant document fbund, S 'can be applied to evaluate subject 
." searches in both online and manual library catalogs. Studies 
employing these methods mi^ht cast sonve light on the relative 
* merits of^ free text vs. controlled vocabulary searchi ng '>on 
bib,} iograplHC records for books', a 'question' often debated by 
librarians. The MUMS and, SCORPI'O, systems at the Library of 
Congress provide an ideal opportunity, to compare the ' capabi 1 iti es 
of two different software systems, one limited, to searching exact 
subject-headings (often phrases) and the other accessible by 
subject word.^Both are Used "to searc-h one database., However, 
comparisons -oWfree text searching with controlled vocabulary 
searching have "been applied to a variety of "databases and systems 
and invariably Ua*d to the same conclusion's*: a comb i nat i on 'of 



Er|c ' • ' 12 



both is best, with the optimal mix .dependent upon the specific 
features' of the .database, the system, and the user's 
requirements. Vendors of database search services have found 
that their customers, demand the fullest range of se.ar'ch 
possibilities available; BRS, Inc. has found that this same 
request is made by the libraries f^ which it provides online 
public catalogs. 25 In this case, the test of the marketplace has 
i ndicated . the desirability of flexible free text searching of 
records for books/. 

Developing software that permits 'free text • searchi ng is a 
simple task in compari son • wi th providing the data elements to be 
searched. Library bibliographic records are , not* rich in 
searchable words. Other than the words in the title of a book,' 
and very occasionally a contents note, few useful terms- exi st in 
the traditional library record beyond those added as specific 
access ^jDoi nts by a cataloger. , Atherton created a database of 
MARC records enriched by descriptive terms from the tables of 
contents .and indexes of the book.s represented. She found that 
the enriched database, referred to as BOOKS, was clearly superior 
for subject searching. Atherton' s work has been widely 
publicized and well ^ecei ved , yet 1 i brari es 5 ha ve not made efforts 
to ' enhance their bibliographic records as she suggested. One 
pos-sible explanation for this inaction is" that practitioners 
believe that further testing and demonstrat i on ' of the value, of 
such enrichment is .necessary. Another fs that- the enhancement 
process adds a workjoad that cannot be absorbed economically. If 
this is the case, it would be useful to know whether it is 
possible to sacrifice the LC subject headi ngs "presently being 
applied j_l exchange for the uncontrolled vocabulary terms 
available in the BOOKS database^ This could be done by analyzing 
the results of the JJOOKS searches and, eliminating matches made 
only on the LC subject heading portion, of the record. Economic 
realities may make "trade-ins" a more realistic possibility than 
acquiring an additional vehicle for subject access. 

13 i 



13- 



' P RECIS * , -? • 

— ' \» • f . 

\ « \ , * • • 

A "trade-in"' oKen debated by -subject catalogers is 'the 
Substitution of PRECIS strings, for Library of Congress subject ^ 
headings orf catalog records produced at LC. ' At the request of 
the ALA Subject Analysis Cojnmi'tt«e, the 'Library conducted a stu'dy 
• in .1977 to test' the feasibility .of adding PRECIS strings ; to LC 
'records. The relatiye merits of-the two systems for use tfy those, 
seeking i n format fon___we' re . not addressed. Such a' comparative 
study, although difficult to de'sign, would be of interest to mapy 
librarians'. However, 'the conclusions drawn by, the Library of 
tongress in 1977 indicate there is no pressing- need to conduct , 
. v >such a study. 'The Subject Cataloging Divi sion .determined that ^ 

, .•' ■ ' i 

.there has been -no publ i g demand that tfl^ Library of 
\ Congress' either replace- the traditional Library of 
"Congress subject headings with PRECIS, 'nor to add PRECIS 
strings to ' traditional catalog cards .or MARC tapes •. . . , 
- ' .In view of the fact that the addition' of PRECIS strings $ " 
to 'all current cataloging... would cost approximately 
• $1,000,000 per 'year and. that* there has been no demand • 
" 'to do this, the Library of Congress will not seek money 
from .Congress or from .any other source to maintain 
two subject headi'n : g?.i ndejtfng systems. 26 « ^ 

The.LC study i s -»n importantVeminder that steps taken to improve r S* 
subject access must real* i sti cal \y assess relevant economic and 
pol i t fear cons i derations. t 

Studies of Users,! Needs 



Quantitative measures- of existing systems for subject 
searching' als-o need to be • suppl emented by behavioral science 
methodology to ' provide qualitative assessments of the needs, 



ERIC • 14 



14- 



- , * « 
• perceptions, and level 'of satisfaction of the systems' users. 27 
• • • A.jnajor study being conducted by the QCLC Research Department has 
employed the focused-group : i nterview technique to determine 
"library users' perceptions,, expectations, and criteria for 
success in using the subject catalog. "28 The study is intended 
to provide the designer* . of online public catalogs with 
• ' .descriptions of the" features that- will support and enhance the 

• 'present subject search' tactics, of l,i brary . users . /Although 

* detailed Analyses .are 'presently being applied to the results of 
200 individual interviews ,;and . 13 group 'Interviews; . the 

" investigators have already . been able to. describe so rife of the 
"desirable*features of an onltne subject catalog. These incTude: 

1. additional access points, i ncfl udi ng v key words -In titles and 
added subject headings describing both, the whole book and 
its chapters; 

" J Z .- online display of a thesaurus to help" Searchers cho^. 
broader, narrower, 'and related -terms; 

3 the ability to define searches with Boolean logic; 

, *• 4.. the. ability to delimit seal's by: a) date,' b) inclusion or 
exclusion ' of inference proceedings, c) level of 
understanding" required by the reader, d) f ictlon/nonf Idt+dn, / 
*e) -language; 

5 1 transparent (i.e., automatic) translation from the users' 
natural language to the terms used in.the catalog; ^ 

' 6. additional descriptive information culled from the book, 
(e.g. /table of . contents) that would permit browsing-at the 
- t e rm firaT -r ather than in -tb e.-s**ck*-±a- _ra^e_xfil_exanc e_ 
judgments. m 

It is worth rating that all of "these features have been used 

ERIC .15 



7 * _.. -15- ' 

successfully to improve subject access in' a variety of 
information databases. 'Librarians designing online catalogs are 
in the fortunate position of being able »|o^- c api ta 1 i ze on the 
developments and experiments made during the past decade in 
providing online access to specialized databases. 



BUILDING ON WHAT IS KNOWN 



Although there is much more ' research to be done on subject 
access, the preceding section also indicates that a .great deal 
more is known than has been applied in library catalogs. A two- 
fold approach' is needed to plan for the future: 1) continuing 
research to determine the most effective means of subject access 
that can be used in libraries, and -2) taking action to f:mpr6Ve, 
and- enhance established methods of subject access. 



Research on Enriched Records 



Word-by-word (i.e., free text) ^searching throughout the 
record as well as subject heading (i.e., controlled vocabulary) 
searching of added entries has bee.n^ enthu's i asti cal ly 'used by most 
online catalog customers served ky^BRS. Delimiting searches by 
data elements in the record and applying Boolean and positional 
operators also make the most of data in bibliographic records... 
But many librarians 'argue that there simply are not enough 
descriptive words in standard record? for monographs to permit 
adequate subject retrieval. The average bibliographic record for 
a monograph contains between one and two subject headings; 



journal articles indexed in common reference tools are given 
considerably more descriptors. 29 There is no rational 
intellectual justification 'for the discrepancy in the "index-term 
per page" ratio for books and articles; the explanation lies in 



9 



ERIC ' 16 



1 ibrary' economics and ' pri ori ties . / The most v successful use of 
free-text searching occurs 'in databases containing abstracts or 
full texts, not just b i bTi ograph i c citations. The possibilities 
for enriching databases 'of 1 ibrary \ records range in a continuum 
from methods requiring .considerable .effort or expense (e.g. 
preparing an abstract for' each work cataloged) to minimal 
enhancements added autojnat realty . 

Atherton demonstrated that 300 additional descriptive words 
•taken directly from the contents of a book and added to a MARC 
record considerably enhanced access to library materials. 
Although she devised an efficient method for this enrichment, it 
appears that no library or group of libraries is willing to pay 
the prie'e of the added labor at this tiifre. . A njuch more modest 

• method of generating additional descriptors mig^ht be to use a 
program to-add to MARC records the appropriate ter\ms-»from the LC 
classification schedules whenever certain class numbers appeared 
in £he record. 30 "As Atherton suggests, continued research is 
.needed to test for the most effective and most practical' 
enhancements to records. In addition, new ideas will need to be 
tested for acceptability in the library community and modified 

accordingly. Academic-libraries, faced with budget cuts, are not 
likely to adopt cataloging- practices that require additional- 
labor. I-n fact, the last decade has shown greater reliance than, 

* in the past on standard Library of Congress records in most 
libraries. s Convincing libraries that they need to enricfc their 
records for subject access will require a large body of research 
and a well argued plan to show that the benefits of\such 
enrichment outweigh ,the costs. t .' \ 

■ V \ 

-Research on Search Failure 

- — — — — - — - \ j_ _ ... ' • 

c -As discussed in the first part of this paper, the reason.s 
cur,rent\ 1 ibrary catalogs fail, to respond to subject search 
requests ^are not adequately understood. For example, it may be 



-17- 



that the fundamental policy underlying library subject cataloging* 
that of. providing only subject headings coextensive with the ( 
entire book -- is at fault. Search failures must be analyzed 
(see section entitled "Failure' Analysis" )" if users' actual 
^demands on the library catalog are to be understood and systems 
to meet these demands designed. • 

■*», 

Studies of online catalogs .(such as those now under wayj 
provide opportunities to gain insight into the causes for failure 
in subject searches through diagnostic analyses of a sample of- 
• search requests. This'would require tracking the searches and^ 
( results, asking two or, three skilled searchers (e.g., reference' 
^librarians) to repeat 9 the searches, showing any additional* 
' documents discovered to, the reader for a relevance assessment, 
and analyzing the reasons why the additional documents were 
missed 'by the 1 i brary ^user. , The results will help individual 
litraWes -aAe-SJS ^egr _g wp_s y s_tects_; _j_f_ _a pattern ^n ^the results, 
pinpoints problems in indexing, inde7Tng[ TangtiageT -entryr 
vocabulary, or general problems in searching strategies, the- 
results will prove useful to the entire 1 i brary^ community. If, 
failure analysis is not built into the online catalog studies now 
in process, special studies should be undertaken. 



Enhancing the .-Current- Method o f Subject Access 

Although its precise .contribution to successful searching 
varied by system, a controlled vocabulary continues to be a 
valuable^component in subject access systems. The controlled 
vocabularies of the many specialized bibl.iographic databases 
available - onTine differ widely, a desirable, condition for 
accessing material in special fields, but one that makes 
_ -searching the databases difficult to master. Research library 
catalogs cover a wide range of subjects and most share a. single, 
V general vocabulary ~. LCk Subject -Headings. An enormous 
investment has been made in tlvese headings; there are literally 



ERIC 




i 

•1 



mi'l 1 ions' of these subject terms embedded, in bibliographic record 
databases in the U.S. and Canada, and the Library of Congress is 
committed to supplying them for -the bibliographic records it 
creates. ( Fren.ch-Ca nadi a-n users of the University of Toronto 
Cataloging System, UTLAS, are even translating LC subject 
headings used in MARC records i nto ^ French . ) Online catalogs must 
hel\ searchers take advantage of LC subject headings. 

\ * 

Librarians- have learned that effective searching*jn subject 
catalogs, requires consulting "the red book", the thesaurus of LC 
subject terms-. Yet other catalog users rarely use this tooT-^ 
only 5% of SCORPIO searchers consult the list placed near LC's 
public access terminals. 31 Online interactive displays of 
thesaurus terms have proven successful in a ^number of retrieval 
systems and the OCLC study has found users to be enthusiastic 
about the idea of using such "an aid.* Thus conversion of the LC 
-snrbjeet— 4-ist to an online thesaurus, mounted by networks, 
utilities, vendors, and other providers of online catalpgs and 
capable of interrogation online by library users throughout the 
country, would add a powerful searching tool to online catalogs. 

How might this conversion be accomplished? The answer 
depends partly on whether to aim for restructuring the- terms in 
the LC list into' a fully hierarchical arrangement or to settle 
for the*more modest objective of a thorough editorial revision of 
the LCSH cross reference structure to bring it u«p to the standard 
of current thesaurus construction. Such an editorial- revision 
was suggested by Angell in 1972 as a means of making the manual 
list. more useful. 32 With the advent of the online catalog, the 
revision would lay the groundwork for an -interactive display of 



*Kaske and Sanders reported resistance by a few scholars -to the 
idea. These few scholars feared the "rigid conceptual relationships 
that they thought the online d/ splay would enforce. One suspects 
these scholars' are reacting t/f the implicates of the phrase tree 
of knowledge" that was used by the researchers to describe the dis- 
plays, and that the schol arf woul d , in fact, find it useful to consult 
a "thes'aurus.." 



narrower terms (NT, current-ly "see also"- references in LCSH), 

related terms ( RT , currently "^ee also" references in LCSH), and 

•broader terms (BT , currently "see also from" notes in LCSH not 

made into references.), in addition to "use" (currently "see") 

references and "used for" notes (currently "see^'from" notes). 

The addition pf scope '.and history notes woul d : gre'at1y compensate 

for changes in terminology and .for the common (and inevitable) 

practice of refining and narrowing subject headings withput' 

retrospecti vely 'subdividing" files. By instructing s^ea^cJb^ers that 

earl i er* materi al on their specific topics may be fou^d under 

certain broader terms, such notes would expand the recall power 

<flr 

of" the system. The revision might also include the addition of 
many specific references that are not now spelled out in the 
pri nted LCSH lists. , 

Restructuring LC terms into a hierarchical arrangement, a 
considerably more ambitious project, is also possible. A true 
hierarchical structure would facilitate experiments in vocabulary 
switching from LCSH to other thesauri as well -as efforts to add 
terms to LCSH for use in special subject areas. It -would also 
insure that the cross references displayed to users reflect 
accurate conceptual relationships between terms and would allow 
users to select terms from a hierarchical display. 

The product of either a revision or a • reconstruct on (in 
either case, the actual subject terms used need not be changed) 
would be a database of subject authority fnformation that would 
be "displayed t>o both searchers and catalog'ers on online 
terminals. In addition 'to enhancing searching, the display would 
•• aid ' catalogers in identifying terms to apply and would ease 
future' edit i ng and* mai n\enance of LCSH. 



*Catalogers find the UTLAS d i spl ay^-orALC subject authorities so 
useful tha~t a group of UTLAS participants cobperati vel y , code and key 
all information 'from each issue of th& LC&Ji Additions and Charrges 
into the UTLAS subject authority file.** 1 



Displaying of t be existing LCSH headings and references will 
not, in itself, solve the. problem of -art entry vocabulary that 
matches' only, half of users' first tries. Even without making a 
single change in an LC subject term, access to the'terms could b£ 
improved enormously -by adding to the, entry vocabulary,' i'.e. , 
adding "see" references. • This can only be accomplished througyh a. 
system of/continuous user-feedback; only the i ndi vi ctjual s at the 
user-end of a subject system can monitor and suggest those terms 
that -are needed. .This is not the fault of the, cataPoger , but is 
inherent in the process^f working from a given book. In a study 
at the University of Chicago, library users were £hown books and 
asked whaA subject- heading they wojuld' Tupply for the titles. 
Participants did not check the, catalog' or -a, list of Lp terms, yet 
very few 0/ the terms they proposed were not on the LC list.' 32 
In oth,er words, the list is adequately designed %f. «match books in 
hand. 'But headers" carry out research- on "pr.obl ems rather - tfran o,n 
books o,r topics, 33 coming^to the catalog with^'an^ unpredictable 
vocabulary. t)nly a compilation of references drawn from actual 
Requests will be rich enough to meet their needs-. Online library 
catalogs provide an opportunity to" capture samples of actual 
request language, although reference librarians are in a position 
to offer suggestions and feedback as well. 'This k"j nd of feedback 
is 'collected regularly ^from the users o.f, database search- 
services: the ERIC* Vocabulary Improvement Program provides a 
model, 'for collecting and editing" suggestion's for new and 'revised 
terms as well as for references.' 34 *»»■ 



Ensuring an Effective User Interface 



r 1 



Tracking and diagnosing s.earch strategies may reveal . some 
predictable patterns in user behavior, .but t-he process of 
research • itself is ' "largely creative, intuitive, 'and 
.unpredictable. Library users *- thinking'* "out loud" into tape 
recorders .during searches of .the*' card catalog at -Ohio State 
University reveal ed " an 1 amazi ng jumbl'e of Tuck,, inspiration,, arrd 

21 




. . -21- ; ' 

'i * ' '■ " 

-* ■ ' ° 

some .knowledge ... triggered by :f 1 i pp i ng through cards. ..."37 

Because the research process. is~on ? e of <>bui Idling continually on 

the answers io questions posed and ^esoj ved by the researcher, 

Swanson has emphasized the importance o f " creat ing a" structure or 

framework within which searchers themselves can exercise maximunV 

ingenuity and re^o urte fulness . " 38 j n desi gn j ng onlj me 1 i brary 

catalogs, the best aid to subject (and ibtfier) searching may be 

the program throug h which the user interacts directly with the 

System, the "user-friendly interface." Descrfpfi ve information 

in the database, and* powerful software to search jt Gannpt be used 

' * *. i -* * ' * 

to full v potential unless the* interface/ i 0 s *easy to Team. 

*•* * ' 

Currently, a researcher using more than .one library (a common 
occurrence) *does not need to learn hjow- to op.en 3r catalog drawer 
or flip through*. the cards ea-ch time he<^ar she enters- a 'new 
system. But ev,en "log-on" .and "sterol 1 irig ,? procedures are likely 
to vary among online catalogs.- While commerei a 1$ onl i ne< serv i ces 
suffer from unfortunate diversify, ^9 -libraries^ Jiave a unique 
.opportunity, right now, to standardize comment! la'ngua^ge, search 
procedures, ana other elements of the user interface for public 
catalogs before independent systems proliferate;^ The BSDf*, 
presently coordinating reseach efforts by libraries employing or 
planning online catalogs, is in an, excellent position ta 
encourage standards development' by the ^.participating 
institutions. ' 

There is r\6 doubt that subject access will be^j betters in the 
onlirre catalog than in existing library card catalogs, The 
question is, how much better? In part, the answer depends on 
act i ons taken now. * " 



/ 



J 



ERjC * 22 C ^ 



c 22 — 

, SUMMARY OF RECOMMENDATIONS 

1. Und-erstand i ng -the relationship between library bibliographic 
records *a(id researchers,' subject searches. v 

/In* American research libraries, the bibHo.graphic record, 
-including terms^ added as subject descriptors, is a ,standardi zed 
product. " Yfet the library profession has not documented how. this 
product is used in subject^ searches or why searches succeed or 
fail. Gross surveys of rates of success or jf a i 1 ure only begin to 
answer the questions and do not indicate how improvements can be 
made. Diagnostic studies of. actual searches done by sam'pjes of 
reader populations, (e^x^ senior ^faculty in a social science 
discipline, graduate students in a hJmani t i es -area , etc..) should 
be undertaken, using method^ol 05 i'es well developed for evaluations 
of information systems. ; There Ms no easy, speedy way to gather 
the large body- of detailed data needed, but v .the enormous U.S\ 
library investment in standard bibliographic records makes the 
need for such data pressing. 

2. Enriching the search terms available in. library bibliographic 

records. " " * 

> * • 

» • 

Li brary ■ b.i bl i ographi c records do jioti include the abstracts 
and" long .lists of descriptors th.at are searchable in many 
information sy'stems. Athe/ton's "Books are Tor Use Project" 
demonstrated that enriched MARC records hold considerable promise 
for improved" subject access. Her recommendations for cont-inued 
research to determine the optimum number, and *ki nds 'of terms added 
to records should be foll.owed. - As methods for producing expanded 
MARC records. are devised, plans for cooperative efforts to do so 
should be developed and tested for acceptability in the library 
community. * The methodology used, by ' Information Systems 
..Consultants, Inc., in developing for* ARL a cooperative plan to 

; • , > 23 



expand access to microforms could be applied to planning for a 
cooperative plan to expand subject access. 

The Subject Access Project, funded by the CoHjnciT on Library 
Resources in 1978, demonstrated the potential of "enriched" 
bibliographic records for subject searchi ng. Another CLfc- funded . 
prdject carried out by Wi 1 1 tarn Mi scho J<n,« 1979 used enriched 
records. to provide' In-depth access to a reference' collection 
through comp uter- generated KWOC indexes. 40 Council support for 
grant proposals in tMs area should continue and expand. BSDP 
could enpourage such research by ^developing and publicizing 
guidelines encouraging researchers, to submit proposals on 
pragmatic methods for enhanced subj ect. retri eval . $ 

*3. Enriching the entry vocabulary for subjeet searches. 

A .rich entry vocabulary is an important component of a 
successful subject access system.- The entry vocabulary currently 
used in most American" libraries is based, upon the • cross 
reference* provided in the Library of Congress. Subject Headings 
(LCSH). While any library ' is free to expand" upon these 
references, few have allotted the extra staff time needed. 

' Any expanded 'entry vocabulary to LC subject terms should be 
'made available for use in American library catalogs, especially 
online catalogs where the switching front.' -entry .term to actual 
terra can be done, automatically. The expanded vocabulary should, 
be:- 1) based on the ac.tual Jang^ge of reader's requests, and 2) 
^developed cooperatively. " One possible way to create' a- file of 
such 'references would.be through, user input to, the Library of 
Coag ress . , ' . 

\ * 
K The Library of Congress has indicated its interest in 

participating in a project to test a mechanism for and the 

utility of collecting user sugges-tions for new LCSH terms and 

additional cross references. 4 ! The test project woul d , col II ect 



i suggested\te^rms from several libraries over a set period of.time. 
The terms would be collected • i n a uniform marimer at each 
institution with project supervisors at each of the .libraries tjo 
sort and edit suggestions before submitting them to LC. The 
suggested, terms would then be /.reviewed and* analyzed by the 
Library of Congress. - An advisory committee might be formed to 
meet wjth LC staff to /det^srjnrne the cgnclusions and 
recommendations indicated by the re.sults of the project. 

4*. Getting the most frorn the control 1 ed vocabulary. 

A controlled vocabulary fs likely to remain an essential 
component # of library subject access for some time 'to come. 
Trends in cgrrent library operations also indicate that LCSH will 
be'^the controlled vocabulary used, by most American libraries as 
long as LCSH terms are provided on LC MARC records. While major 
changes' ift LCSH terms would be burdensome to American . 1 ibraries* 
already editing their catalogs to conform to AACR2, a 
.reconfiguration of the LC list into an online thesaurus would 
.create a reference tool of gr*eat benefit to both libraries and 
readers. In addition,, the thesaurus could f orm- t he' bas i s for 
expanding the list in special fields or for machine translations 
into other Jyocabul ari^s . \* >«' 

^ The Library of Congress i $ * i nt erected ' i n improving th'e LCSH 
cross reference structure in conjunct ion\wi th the Library's plans* 
"to automate and distribute its subject authority file. As 
previously noted, a range \>f possibilities for improvement 
exists, running from a 1 imited)editorial . project to a recasting 
of the list into a hierarchical thesaurus. • The Li brary ' woul d 
like to participate in a small, working meeting of no more than 
ten experts in subject access and thesaurus construction for 
automated^ystems . The objective of the meet'ing wquld be to 
explore in derail alternatives for "editing, or-restructuring LCSH 
(without revising the subject terms use d) and to define the 
project(s)'Vequi red to'reach the most feasible and desirable end 



product. ' 



5. Developing and promoting standards forUhe user interface in 
the online, library .catalog. " . . 

It is rtfcomme'nded that the BSDPform a tforki ng ^g ro.up of 
librarians currently involved in the development of online public 
catalogs. —The group members would define those elements of the 
interface that should be standardized (e.g. command language) and 
begin to develop standards "to which. their emerging catalogs could 
adhere. Further standards devel.obnfen.rmidhT: then be turned over 
to an ANSI.Z39 subcommittee, sucWs|fte Z3g*group now working, on 
terms and syrnb^ s in retrieval systems'. 

* 




-26-, 



REFERENCES 



« 1. Bates, Marcia. (1977). "Factors Affecting Subject Catalog 

Search Success." Journal of the American Society for I.nfqr- 
'* mation Science 28, 161-169. 



/ 



2. Krikelas, James S. (1980-81). "Searching The library Cata- 
log --.A Study of Users' Access." Library Research 2, 215- 

- 7 

. 230. • / . 



/ 



3. Lipetz, Ben-Ami. (1970). "Us,er Requi/ements in Identifying 

' Desired Works in a Large Library." Vale University Library, 
New Haven, Conn. (ED 042~479). 

4. Fayen, Emily Gallup." (19'80). "A Survey of Dartmouth College 
Library Card Catalog Users." Unpublished report. Dartmouth 
College Library, Hanover, N.H. 



5. 



Fayen., Emily, telephone interviev/, March 31 , 1980. 



6. ; Kaske, NealV. , a n d\£A n d er.s ', Nancy P. (1980). ."On-line Sub- 

ject Access; The Human Sjde of the P.roblem." Reference 
Quarterly 20, 52-58. 

7. Kaske, 0p..*cit. 

• 8. Pritchard,' Sarah M. (1981). "SCORPIO: A Study of the Public 
Users of the Library of Congress Information System." Unpub- 
v i i shed report. Library of 'Congress, Washington, D.C. 

,9. Haftner, Ruth. > ( 1979) . "The performance of Card Catalogs: 
A Review, of Research-."" Library Research 1 , 199-22?. 

10. Bates , 0p.\cit. 



.ERIC . * • 27 



27- 



11. McFadden, Thomas. (1981). RIT Interface Project, Phase 2: 
Interim Progress Report. Unpublished re'port;. Rochester * 
Institute of Technology, Rochester, N.Y. 
Fayen reports similar observations at Dartmouth. 

12. Atherton, Pauline. (1978). "Books Are For Use: Final Report 
of the Subject Access 'Project to the Council on. Library 
Resources. * Syracuse University Schoo'l conformation Studies. 
Syracuse , N. Y. [ * 

13. Kaske, Op. cit. 

14. O'Neill, Edward T.., and Al uri , Rao. (1981).. "Library of 
Congress Subject Heading Patterns in OCLC Mo nographi c- Records . " 
Library Resources and Technical Services,, 25(1 ) 63-80. 

63-80. 

15. McClure, Charles R. (1 976).- "Subject an'd Added Entries as 
Access to Information." Journal,- -of. Acadejl* Llbfjrlanship, , 

.2 (1), 9-14. % . 

4 * t 

16. \ancaster. F. W. (1978). "The Cost-Effective Analysis of 

" Information. Retrieval and. Di sseminati on Systems ." In Key_ 
papers, jn the Design and Evaluation of In format i on ^sterns 
(D. King, ed), pp. 23-38. Knowledge Industry Publications, 
Whjte Plains-, N.Y. 

17. Lancaster, F. W. (1 977 ). The Measurement and Eval uation of. 
Library Services . Information Press, Washington, D.C., p. 30. 

18. Lipetz, Op. cit., p. 65. 

19. Atherton, Op . <c it. , p. 35-36. 

20. international PRECIS Workshop. (1977). The PRECIS index . 
' System; Principles, Applications, and Projects,; Proceedings 

ERIC 28 



28- 



) 



of the International- PRECIS Workshop, Un i versi ty' of Maryland, 
Octob'er' 1 5-1 7 , 1976 (H. Wellish, ed.), p.197. H.W. Wilson 
Co. , New York , *N. Y. 

21. SvenoniusJ Elaine. (1981). "Directions for Research' in 

. Indexing, CI ass if ication', an-d Cataloging." Library Resources 
and Technical Services 25(1), 88-103. 

9 t 

i> 

22. King, Donald W., and Bryant, Edward CT (1971)'. The/Eval u- 
: t uation of Information ' Services and Products , pp. 150-152. 

Information Resources Press, Washington, D.C. 

23. - s Brenner ,' Li sa P., et al . (1980-81) . " User-ComputeV Inter- 

face Designs for Information Systems: A Review." Li brary 

* 

* Research 2, • 63-73. 

24. King, Op. Cit., pp. 184-185. • . » 

25. .Kassebaum, Laura, personal interview, February 17, 1 9 § 1 • 

26* Library of Congress Subject Cataloging Division. (1 978). 
PRECIS Project. Unpublished report.^ Library of Congress, 
Washi ngton, % D. C. v * / 

27. y Swansbn, 'Rowe-na Weiss.. ( 1 975). "Design and Evaluation of 
^Information Systems." In Annual f Review of Information t . 

Science and Technology 10 (C. Cuadra and A. Luke, eds.), 
pp. 43-101. American Society for Information Science, 
, . Washington, D.C. 

28. Kasfce, Neal Yr. , and Sanders, Nancy P. (1980). ^"Evaluating 
the Effectiveness of Subject Access: the View of the Library 
Patron." In Communicating Information* : . Proceedings of the 
AS IS ' Annual Meeting 17, 1980, ppi '323-325. Knowledge 
-Industry Publications, Wh i te ,P 1 a i ns , N'.Y. 




o •, 29 

ERIC ' 



'-2.9- - 

29. McClure, Op. cit. * 

30. Wellish,- Hans. (1972). "Subject RetrievalMn the Seventies - 
Methods, Problems, Prospects,' 1 . In Subject Retrieval in the 
Seventies: New Directions ; Proceedings of an International . 
Symposium, University of Maryland, May 14-15, 1971 (H. Wellish 
and T.D. HI 1 son , 'eds . ) , pp. 2-27. Greenwood Publishing Co., 
Westport, Conn-. 

The utility of this suggestion can be easily tested by check- 
ing the language of a sample of reader requests against botfrLCSH 
and the cl ass i f i cat i on 'sched ul es . Requests collected duri.ng the 
online catalog studies now being coordinated by the Council on 
Library Resources could be used. ^ 

' 31. Pritchard, Op. cit. 

32. Angei;, Richard S. ( 1972). "Library of Congress Subject 
Teadi'ngs -- Review and Forecast." In' Subject Retrieval i n the 

Seventies: New Directions ; Proceedings of an International 
Symposium, University of Maryl and, «May 14-15,-1971 (H. Wellish 
and T.D. Wilson, eds.^, pp. 143-167. Greenwood Publishing . 
Co. , Westport , Conn. \ 

33. Rood, -Joanna, Telephone interview, April 16, 1981. 

34. Swanson, Don R. ( 1 972).' Requirements St udy for Future Lj_brary_ 
Catalogs: Final Report to the National Science Foundation. 
University, of Chicago Graduate. Li brary School, Chicago, VrK^ - 

35. Swanson Don R. ,(1979). "Libraries and the Growth of Knowledge.-"^ 
Library Quarterly 49 (1 ), 3-25- 

36. Booth, Barbara. ("1 979). - " A 'New' E'RIC Thesauras, Fine-Tuned 
for Searching:" -OnJJjie^ 4979,, 20-29. 



o - • 30 

ERIC 



-30- 



37. Herndon, Gail A, and Van Pu 1 i s ,' Noel 1 e . ( 1 979 ). "The On-line 
'Library: Problems and- Prospects for User Education." In New 

Horizons for Academic Libraries ; Papers Presented at the Fi rst_ 

National Conference of the Association of College and Re- 
search, Li brartes , Bdston, Nov. 8-11, 1978 (R - . Stueart and R. 

Johnson, edsO^pp. 539-544. K. 'G. Saur, New York, N.Y. 

38. . Swan'son, Don R . ( 1 97 9 ) Op . c i t . 
.39. 



Atherton, Pauline (1978). "Standards for a User-System Inter- 

9 

face Language in On-line Retrieval Systems." On! ine Review 
2 (1), 57-61. 



40. Mischo, William H. (1979). "Expanded Access to Reference Collec- 
tion Materials." Journal of Library Automation 12 (4)*, 338-354. 

41. The recommendations in sections 3 and 4 are based on a meeting 

. at the Library of Congress on April 15, 1981. ' Participants were 
Carol Mandel, Mary K. D. Pietris, Lucia Rather, and Robert 
"Zich. At that meeting, Ms. Rather indicated her willingness 
to work with the Council on Library Resources in implementing 
the recommendations proposed, j - 




